Information retrieval with distributed databases: analytic models of performance
نویسندگان
چکیده
منابع مشابه
High-Performance, Distributed Information Retrieval
We propose an information retrieval (IR) system called KEYNET which is a high-performance, distributed search engine for locating information objects in a large subject-specific corpus. The system could handle up to a few million information objects with performance at a level of hundreds of queries per second (with current workstation technology). We have developed a prototype keynet system fo...
متن کاملPerformance Analysis of Distributed Information Retrieval Architectures
Large document collections are increasingly available over the network. In order for users to access these collections, information retrieval systems must provide coordinated, concurrent, and distributed access. Since even unified information retrieval (IR) systems place heavy demands on system resources, it is unclear how performance will be affected as user demand increases and the distribute...
متن کاملQuantitative Models for Performance Enhancement of Information Retrieval from Relational Databases
We consider a collection of relational databases that are accessed by users with different profiles. We develop optimal retrieval strategies and system designs to present to the users their required information from relevant databases. We use queuing-theory-based analytical models as well as simulations to obtain system performance measures.
متن کاملBridging Information Retrieval and Databases
For bridging the gap between information retrieval (IR) and databases (DB), this article focuses on the logical view. We claim that IR should adopt three major concepts from DB, namely inference, vague predicates and expressive query languages. By regarding IR as uncertain inference, probabilistic versions of relational algebra and Datalog yield very powerful inference mechanisms for IR as well...
متن کاملInformation retrieval from biological databases.
As discussed earlier in this book, GenBank was created in response to the explosion in sequence information resulting from a panoply of scientific efforts such as the Human Genome Project. To review, GenBank is an annotated collection of all publicly available DNA and protein sequences and is maintained by the National Center for Biotechnology Information (NCBI). As of this writing, GenBank con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems
سال: 2004
ISSN: 1045-9219
DOI: 10.1109/tpds.2004.1264782